Fast Morphological Analysis of Czech

نویسنده

  • Pavel Smerk
چکیده

This paper presents a new Czech morphological analyser which takes an advantage of Jan Daciuk’s algorithms for minimal deterministic acyclic finite state automata. The new analyser is six times faster than the current analyser ajka concerning the proper analysis, i.e. returning possible lemmata and tags for a given word form, but for some other related tasks is the difference even bigger.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Morphological and Crystallographic Characterization of Nanoparticles by Granulometry Image Analysis and Rietveld Refinement Methods

The particle size distribution of the resultant cobalt ferrite samples was determined from Scanning Electron Microscopy (SEM) images using the granulometry image analysis method. Results showed the nanosized particles of the samples. The X-Ray Diffraction (XRD) patterns of samples were also analyzed by Rietveld refinement method. The results indicated that the precipitated sample at 95 <sup...

متن کامل

Morphological Analysis of Law Texts

In the paper we explore the morphology of the Czech law texts including Constitution, acts, public notices and court judgements which form a huge textual database. As many texts from small domains, the used language is partially restricted and in relevant aspects also different from general Czech. The paper presents first results of the morphological analysis of Czech law texts and their conver...

متن کامل

Using A Range of PVB Spinning Solution to Acquire Diverse Morphology for Electrospun Nanofibres

Morphological changes in Polyvinyl Butyral (PVB) electrospun nanofibres can be acquired by preparation of PVB spinning solution in different solvents.  Accordingly, three solvents, including ethyl alcohol, n-butanol and isopropanol, with diverse physical properties (e.g. boiling point, density, dipole moment and dielectric constant) were used to prepare PolyVinyl Butyral (PVB) spin...

متن کامل

Improving Statistical MT through Morphological Analysis

In statistical machine translation, estimating word-to-word alignment probabilities for the translation model can be difficult due to the problem of sparse data: most words in a given corpus occur at most a handful of times. With a highly inflected language such as Czech, this problem can be particularly severe. In addition, much of the morphological variation seen in Czech words is not reflect...

متن کامل

Merging Data Resources for Inflectional and Derivational Morphology in Czech

The paper deals with merging two complementary resources of morphological data previously existing for Czech, namely the inflectional dictionary MorfFlex CZ and the recently developed lexical network DeriNet. The MorfFlex CZ dictionary has been used by a morphological analyzer capable of analyzing/generating several million Czech word forms according to the rules of Czech inflection. The DeriNe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009